-
Notifications
You must be signed in to change notification settings - Fork 600
[GPT-OSS-120B] Reference implementation #2395
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
add better parsing and check for harmony tokens
|
|
||
|
|
||
| def decode_hex_to_tokens(hex_data: str) -> List[int]: | ||
| """Decode hex string to list of token IDs (int32). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not a concern here - we added int32 after int64 didnt work so ds-r1 code has support for both while defaulting to int32:
eval_dtype = np.int32
|
|
||
| total_correct = sum(stats["correct"] for stats in dataset_stats.values()) | ||
| total_samples = sum(stats["total"] for stats in dataset_stats.values()) | ||
| overall_accuracy = ( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
consider putting formulas on one line, if block on next
similar below (final-score-percentage)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IIUC these are linux soft links?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah that is right
|
@arjunsuresh @mrmhodak can we try to get this merged soon? 🙏 |
|
@v-shobhit once the WG approves I can merge it. I'm told to not do any merge outside of the WG discussions 😕 |
Zhihan confirmed this isn’t an issue and will address it after the merge, if needed.
|
@anandhu-eng will create an empty commit as a workaround for this checker issue. This problem ultimately needs to be resolved by the Automation Work Task Force. |
|
PR is ready to be merged once approved. |
hanyunfan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As we talked in WG, this is approved.
|
@v-shobhit merged. thanks for all the hard work! |
Adding gpt-oss-120b reference implementation.
This uses SGLang backend to serve gpt-oss-120b model. Scripts are provided to run this in Offline/Server scenario in PerformanceOnly/AccuracyOnly modes.